British Journal of Cancer — Latest Matching Preprints

1

Using colorectal cancer screening evidence to stratify for personal risk among those with a family history of colorectal cancer: a 42-year cohort study

King, D. W.; King, P. E.; Blanchard, M. W.; Ning, N. W.; King, S. K.; Grimm, M. C.; Ha, T.; Eagar, K.

2026-06-08 health systems and quality improvement 10.64898/2026.06.04.26354891 medRxiv

Top 0.1%

7.3%

Show abstract

Objective To determine if it is possible to assess individual patient risk of the development of colorectal cancer (CRC) in people in high-risk groups due to their family history. Design/Method Retrospective observational study of prospectively collected data from consecutive patients referred for a colonoscopy. 2,478 consecutive patients were referred to a single colorectal surgical practice in Sydney, Australia between 1977 and 2018 for a colonoscopy because of a family history of CRC. Of these, 1,963 have been followed for more than 10 years and are the subject of this paper. Histopathological findings categorised as normal (N), non-advanced adenoma (NAA) or advanced neoplasia (AN) with AN proven to be the precursor to CRC. Intervention Colonoscopic screening on the basis of contemporary practice to 2006 and subsequently according to Australian National Health and Medical Research Council guidelines. Results Participants with normal or low-risk findings in the first decade remain at lower risk of CRC for 30 years from the commencement of screening. Conclusion It is possible to stratify individual patients in a high relative risk cohort into those with high or low personal risk of CRC based on colonoscopic findings in the first 10 years of surveillance. Those with no AN in the first ten years have a lower 30-year risk of developing AN than the general community. This offers the possibility of structuring surveillance programs around individual risk rather than group risk, lessening the need for multiple surveillance colonoscopies in the majority of such patients and improving the cost effectiveness of CRC screening at the population level.

2

Formalising Limits of Circulating Tumour DNA Detection: A Signal Detection Framework for Clinical Threshold Specification

Walinjkar, A.

2026-06-10 oncology 10.64898/2026.06.08.26355204 medRxiv

Top 0.2%

6.7%

Show abstract

Background: Circulating tumour DNA (ctDNA) liquid biopsy is now established across oncology for early cancer detection, minimal residual disease surveillance, and treatment monitoring. Detection thresholds for all current ctDNA assays are derived empirically through receiver operating characteristic analysis on training cohorts - a statistically valid but theoretically uninformed approach that does not specify the minimum detectable tumour fraction given assay technical characteristics, nor identify when increasing sequencing depth ceases to provide additional clinical information. Methods: We model ctDNA detection as a binary hypothesis testing problem with Binomial-distributed mutant allele counts against a sequencing error noise floor. The Neyman-Pearson lemma is applied to derive the uniformly most powerful detector and the minimum detectable tumour fraction in closed form. The sequencing assay is modelled as a binary symmetric channel and Shannon channel capacity is calculated. Empirical validation uses n=61 data points extracted from five published peer-reviewed analytical validation studies across five independent institutions in the US and EU (2018 - 2025): Yu et al. 2022, Stetson et al. 2018, Frydendahl et al. 2023, Northcott et al. 2024, and Cheng et al. 2025. Results: The minimum detectable tumour fraction is derived in closed form as f_min approximately equal to (z_alpha + z_beta) multiplied by the square root of (epsilon divided by N), where N is sequencing depth, epsilon is the platform error rate, and z_alpha, z_beta are standard normal quantiles at the specified false positive and false negative rates. Shannon channel capacity is C = 1 minus H(epsilon) bits per read, where H(epsilon) is binary entropy. Empirical validation yields 84.3% agreement for single-locus assays. Discordance for multi-locus tumour-informed assays (NeXT Personal, duplex WGS) is consistent with the single-locus model scope and identifies the principal theoretical extension required. Conclusions: This framework provides the first formal Neyman-Pearson optimality proof for ctDNA detection, a closed-form detection limit, and a platform-independent efficiency metric for NHS and regulatory standardisation. Keywords: circulating tumour DNA; liquid biopsy; Neyman-Pearson detection; Shannon channel capacity; sequencing depth; limit of detection; minimal residual disease; signal detection theory

3

Integrated T-Cell Receptor Repertoire and Tumor Immunogenicity Profiling Reveals Distinct Immunogenomic States in Endometrial Cancer

Aversa, I.; Abatino, A.; Isabello, A.; Gallo, R.; Isdraele, L.; Straface, T.; Zullo, F. M.; Guida, M.; Saccone, G.; Fiume, G.; Venturella, R.; Viglietto, G.; Cuda, G.; Costanzo, F.; Zullo, F.; Palmieri, C.

2026-06-10 oncology 10.64898/2026.06.08.26355191 medRxiv

Top 0.3%

4.3%

Show abstract

Background Endometrial cancer exhibits marked molecular and immune heterogeneity that is only partially explained by established genomic biomarkers. We investigated whether T cell receptor (TCR) repertoire architecture captures complementary dimensions of antitumor immunity beyond conventional molecular classification. Methods Paired tumor and peripheral blood samples from eight patients with molecularly characterized endometrial cancer underwent TCR repertoire profiling. Diversity, clonality, and tumor blood overlap metrics were integrated with genomic variables, including tumor mutational burden (TMB), genomic instability metric (GIM), and POLE status. Principal component analysis and correlation analyses were used to identify major dimensions of repertoire organization. Composite Immune Focusing and Immune Sharing Scores were derived to summarize dominant repertoire patterns. Results The first two principal components explained 70.1% of total repertoire variance and revealed substantial heterogeneity independent of histological subtype. TMB was strongly associated with reduced repertoire diversity and increased clonal dominance, resulting in a robust association with the Immune Focusing Score ({rho} = 0.88, p = 0.004). POLE mutated tumors occupied the extreme end of this focusing continuum. In contrast, genomic instability was associated with increased tumor blood repertoire overlap and preserved diversity, reflected by a strong correlation between GIM and the Immune Sharing Score ({rho} = 0.76, p = 0.027). The two immune scores showed minimal correlation with each other ({rho} = -0.24, p = 0.57), indicating that they capture largely independent aspects of immune organization. Conclusion Integrative analysis of TCR repertoire architecture and tumor genomics identifies distinct immunogenomic states in endometrial cancer that are not fully captured by conventional molecular classification. If validated in larger cohorts, immune focusing and immune sharing metrics may provide complementary biomarkers for patient stratification and immunotherapy-oriented precision oncology

4

Incidence and Predictors of IOP-Lowering Treatment Following Detection of Referable Glaucoma in a Teleretinal Screening Program

Bolo, K.; Wong, B.; Do, J.; Ambite, J.-L.; Li, Z.; Kesselman, C.; Daskivich, L.; Xu, B.

2026-06-04 ophthalmology 10.64898/2026.06.02.26354782 medRxiv

Top 0.4%

3.7%

Show abstract

Purpose: To evaluate the incidence and baseline predictors of intraocular pressure (IOP)-lowering treatment following detection of referable glaucoma by teleretinal screening. Design: Retrospective cohort study. Methods: Participants were derived from a safety-net teleretinal diabetic retinopathy screening program (2013-2024). Participants included individuals who screened positive for referable glaucoma (cup-to-disc ratio [CDR] [≥]0.6 or CDR asymmetry [≥]0.2) and completed in-office diagnostic evaluation. The primary outcome was initiation of IOP-lowering treatment (medication, laser, or surgery) and the secondary outcome was intervention with surgery. Cumulative incidence functions were estimated, accounting for loss to follow-up. Fine-Gray models were used to identify baseline screening predictors to risk stratify each outcome. Glaucoma diagnosis was approximated using diagnostic codes and chart review. Results: 2,367 participants were included. The cumulative incidence of treatment was 19.6% (95% CI: 18.0-21.2) at Year 1 and 45.1% (42.1-48.1) at Year 8. Early treatment occurred primarily in glaucoma cases, whereas treatment accumulated longitudinally in glaucoma suspects, reaching 36.5% (31.6-41.5) by Year 8. Surgery was less common (8-year incidence: 5.3%). Baseline screening data predicted treatment and surgery, enabling risk stratification. At Year 8, cumulative incidence differed substantially between high- and low-risk groups (treatment: 59.9% vs. 31.2%; surgery: 9.7% vs. 1.0%). Older age (sub-distribution hazard ratio [SHR] 1.03 per year, p<0.001), Black race (SHR 1.50, p<0.001), and personal history of glaucoma (SHR 1.90, p<0.001) were associated with treatment; Asian race was protective (0.71, p=0.03). Older age (SHR 1.06, p<0.001), worse visual acuity (SHR 5.11 per logMAR unit, p<0.001), and screening at a hospital-based site (SHR 2.46, p=0.003) were associated with surgical treatment. Conclusion: Nearly half of safety-net diabetic patients screening positive for referable glaucoma initiated IOP-lowering treatment over 8 years, while few received surgery. Baseline screening characteristics enabled risk stratification of treatment and surgery. These findings address an evidence gap about longitudinal consequences of screening and suggest that its impact extends beyond detection of prevalent glaucoma to include identification of high-risk glaucoma suspects who warrant ongoing surveillance.

5

Title: Development of a Human Papillomavirus genotype-informed risk-stratification model to improve Cervical Cancer screening in resource-limited settings: a cross-sectional study

Kambou Kountchou, K. D. K. K.; Tommo Tchouaket, M. C.; Moko Fotso, L. G.; Fokou Bomgning, B. N.; Fippo Fitime, L.; Talom Teumadjou, A.; Routoube, M.; Efakika Gabisa, J.; Ngoufack Jagni Semengue, E.; Nka, A. D.; Kae, A. C.; Dobgima Pisoh, W.; Deutou, L.; Takou, D.; Fainguem, N.; Sosso, S. M.; Kamgaing Simo, R.; Yagai, B.; Tabola Fossa, L.; Perno, C.-F.; Colizzi, V.; Enow-Orock, G.; Fokam, J.; Terrinoni, A.; Kuiate, J.-R.

2026-06-10 pathology 10.64898/2026.06.06.26355059 medRxiv

Top 0.4%

3.7%

Show abstract

Background: In resource-limited settings, a critical bottleneck in cervical cancer prevention is the lack of practical strategies to triage high-risk human papillomavirus (HR-HPV)- positive women. Therefore, this study aimed to develop and internally validate a genotype-specific risk stratification model. Methods: A cross-sectional study enrolled 555 women in Cameroon. Data collection integrated cervical cytology and HPV genotyping using Abbott m2000rt and Sacace multiplex systems. An iterative modeling approach with bootstrap validation was used to develop the model and address model instability. HR-HPV genotypes were transformed into a hierarchical risk variable due to sparsity and integrated with significant predictors. The final model was translated into a scoring system, and the risk gradients and performances were evaluated at two thresholds. Data was analyzed using SPSS 27.0. Results: The mean age was 44.8 years, and the prevalence of HR-HPV was 26.5% (147/555). The final model, incorporating HPV categories, age, and tobacco, demonstrated moderate discriminative ability (AUC=0.702, 0.642-0.762) with a good calibration (Hosmer-Lemeshow {chi}{superscript 2}=4.05, p=0.399). The scoring system assigned women to risk groups based on their total scores which produced a clear monotonic risk gradient; the observed probability of high-grade lesions/cancer ranged from 15% (score 0) to >65% (score [≥]4). At a conservative threshold ([≥]4 points), 4.7% (26/555) of women were classified as high-risk, concentrating 46% (6/13) of cancers (positive predictive value[PPV]=58%) while a sensitive threshold ([≥]3 points) had 16.8% (93/555) high-risk, concentrating 77% (10/13) cancers (PPV=38%). Both thresholds maintained a high negative predictive value (>95%). Conclusion: This bootstrap-validated, risk-stratification tool is a proof-of-concept in resource limited settings that assigns HR-HPV-positive women to distinct management pathways using three variables. After refining through a longitudinal study and external validation, this scoring system can improve the efficiency of cervical cancer screening programs in low-resource settings.

6

Immunohistochemical phenotype is associated with metastatic site in breast cancer: a retrospective pathomorphological study of women from the Lower Aral Sea region, Uzbekistan

Khodjaniyazov, A. A.; Rojobov, R. R.

2026-06-08 pathology 10.64898/2026.06.05.26354969 medRxiv

Top 0.6%

2.1%

Show abstract

Background: Breast cancer is the most frequently diagnosed cancer and the leading cause of cancer death in women worldwide, and the great majority of these deaths are caused by metastatic disease. Whether the immunohistochemical (IHC) phenotype of breast cancer is associated with the anatomical site of metastasis has been characterized mainly in high-income, registry-based populations, while data from ecologically stressed and medically under-served regions such as the Lower Aral Sea basin are lacking. Methods: We retrospectively reviewed 652 women diagnosed with breast cancer at the Khorezm Branch of the Republican Specialized Scientific-Practical Medical Center of Oncology and Radiology (Uzbekistan) between 2020 and 2024, of whom 213 had metastatic disease (306 metastatic foci). Histological type was assessed on hematoxylin-eosin and van Gieson-stained sections; quantitative morphometry was performed in Fiji/ImageJ; and HER2, estrogen receptor (ER), progesterone receptor (PR) and Ki-67 were assessed by IHC. The association between marker expression and metastatic site (liver, lung, lymph node) was tested in 187 foci with adequate tissue using the chi-square test, with significance at p < 0.05. Results: Invasive ductal carcinoma predominated. Metastatic site was significantly associated with the IHC phenotype. Liver metastases showed the highest frequency of HER2 3+ (45.7%), ER-negativity (65.2%), PR-negativity (69.6%) and high proliferation (Ki-67 [≥] 60%; 47.8%), whereas lymph-node metastases were more often hormone-receptor-positive (ER+ 58.7%; PR+ 52.4%) with lower HER2 3+ (22.2%); lung metastases were intermediate (all p < 0.05). The combination of HER2 3+ and Ki-67 [≥] 60% was associated with multi-organ spread. Morphometry corroborated these patterns: liver lesions had larger atypical cells (up to 132.8 m), a higher nuclear-to-cytoplasmic ratio (0.76 vs 0.51) and more extensive necrosis and microvascularity than lymph-node lesions. A pragmatic 5-criterion morphological score (histological type, Ki-67, HER2, ER/PR status, atypical-cell size) stratified metastatic risk into three tiers. Conclusions: In this regional cohort, the IHC phenotype of breast cancer tracked the anatomical site of metastasis, with an aggressive HER2-driven, hormone-receptor-negative profile concentrated in liver metastases and a hormone-receptor-positive profile in lymph-node metastases. These findings reproduce established organotropism patterns in a previously uncharacterized population and support phenotype-aware, site-specific surveillance together with a low-cost morphological risk score for resource-limited settings.

7

Understanding Human AI Discrepancy in Breast Cancer TIL Assessment: A Multi-Rater and Perceptual Bias Study

Capar, A.; Aloglu, I.; Aker, F.; Ertano, M.; Mese, Y. E.; Ungor, A.; Yildiz, B. E.

2026-06-04 pathology 10.64898/2026.05.29.26354196 medRxiv

Top 1%

1.1%

Show abstract

Objective: Tumor-infiltrating lymphocytes (TILs) in breast cancer are one of the most important indicators of the immune response within the tumor microenvironment. They play a particularly significant prognostic and predictive role in triple-negative and HER2-positive subtypes. However, substantial inter-observer variability has been reported in TIL scoring among pathologists, which limits its reliability in clinical practice. The aim of this study was to evaluate the agreement between artificial intelligence (AI) models and pathologists in TIL scoring and to compare this agreement using different statistical approaches, thereby assessing the potential of AI integration into pathology practice. Materials and Methods: Digitized histopathological images of breast cancer cases were included in the study. Tumor regions annotated by pathologists were evaluated for both stromal TIL percentage and the proportion of stromal tumor area within each ROI, with assessments performed independently by three pathologists and two AI models. Agreement was assessed among pathologists, between pathologists and AI, and between AI models. Statistical analyses included intraclass correlation coefficient (ICC), Cohen and Fleiss kappa, correlation tests, and Bland-Altman analysis. In addition, categorical agreement was examined using different cut-off values. Results: Inter-pathologist agreement was high, with an ICC of 0.81. In contrast, the global agreement between pathologists and AI models was lower (ICC 0.41). Pairwise comparisons of pathologist-AI agreement yielded substantially lower ICC values (0.12-0.21), although this improved to 0.53 when three pathologists were assessed jointly with a single AI model. The strongest categorical agreement was observed with dichotomized TIL scores ([≤]10% vs. >10%), whereas multi-category classifications were associated with a marked reduction in kappa values. Spearman correlation coefficients between pathologists and AI models ranged from moderate to good ({rho} = 0.48-0.81). Agreement between the two AI models themselves was moderate, with an ICC of 0.64

8

Closing the Paediatric Gap: Adult-Trained AI Generalises Robustly to Paediatric Coeliac Disease Diagnosis

Jaeckle, F.; Gillett, P. M.; Kirkwood, K. J.; Natu, S.; Chan, J. Y. H.; Bateman, A. C.; Arends, M. J.; Soilleux, E. J.

2026-06-05 pathology 10.64898/2026.06.04.26354889 medRxiv

Top 1%

1.0%

Show abstract

Background Coeliac disease (CD) diagnosis on duodenal biopsies is limited by interobserver variability. We have previously demonstrated pathologist-level performance with our artificial intelligence (AI) model for the histopathological diagnosis of adult CD, but not in paediatric practice. As paediatric CD screening programmes expand internationally, accurate and scalable diagnostic tools are needed. We investigated whether an AI model trained exclusively on adult whole-slide images (WSIs) can generalise to paediatric CD diagnosis across independent centres. Methods A training and validation dataset of 9,958 WSIs from 8,421 adult patients (961 CD) from five centres was used to develop an ensemble of multiple-instance learning models using features from a foundation model. Testing was performed on 708 consecutive paediatric patients (86 CD) from two centres (Edinburgh and Southampton) not included in training. Model calibration was assessed, and probability outputs were grouped into clinically interpretable categories. Findings In adult cross-validation, the AI model achieved an area under the receiver operating characteristic curve (AUC) of 98.7%, sensitivity of 84.9%, specificity of 99.0%, and negative predictive value (NPV) of 98.1%. On testing (paediatric) datasets, performance remained high (AUC 98.8%, sensitivity 80.2%, specificity 98.4%, NPV 97.3%). Restricting analysis to predictions outside the intermediate-probability range (predicted CD probability <10% or [≥]65%; 85.3% of cases) improved sensitivity to 100% and specificity to 98.7%. No misclassifications were observed among high-confidence predictions (<2% or [≥]85%; 66.0% of cases). The expected calibration error was 0.03. Performance improved significantly when biopsies from both duodenal sites (bulb [D1] and descending [D2/3]) were considered. Interpretation Our AI model, trained on adult biopsies, generalises to paediatric CD diagnosis across centres and scanner platforms. Well-calibrated probability outputs provide clinically interpretable measures of diagnostic confidence and could support safe identification of CD-negative biopsies within defined thresholds. These findings demonstrate the feasibility of applying adult-derived AI models in paediatric populations and reinforce the importance of multi-site (D1 & D2) biopsy sampling.

9

Physical activity, fatty acids, and MASLD risk: Behavioural and metabolic factors jointly shaping liver health in populations

Chen, F.; You, R.; Liu, Y.; Yin, Y.; Liu, A.; Deng, L.; Xie, B.; Fan, J.; Wang, W.

2026-06-08 epidemiology 10.64898/2026.06.05.26354982 medRxiv

Top 1%

0.8%

Show abstract

Background and Aims: MASLD has become the most prevalent chronic liver disease globally. Although MVPA and plasma fatty acids have been individually studied in relation to metabolic health, their independent and combined associations with MASLD incidence remain unclear. We aimed to investigate these associations. Methods: This study included 51,717 UK Biobank participants free of liver disease at baseline, with MVPA measured using wrist-worn accelerometers and plasma fatty acids quantified via NMR. Multivariable-adjusted Cox models and restricted cubic splines were used. Results: Over a median follow-up of 7.8 years, 472 incident cases were identified. In fully adjusted models, meeting recommended MVPA levels together with higher n-6 PUFA concentrations was associated with a 71% lower risk (HR 0.29, 95% CI 0.18-0.45). The MVPA-MASLD association was nonlinear, with risk reduction plateauing at approximately 189 minutes per week. Higher n-6 PUFA was associated with reduced risk, whereas n-3 PUFA showed no significant association. Conclusions: These findings suggest that behavioral and metabolic factors may jointly influence MASLD risk. Further studies in diverse populations are needed to confirm these associations.

10

Lung cancer pathway inequalities for adults with severe mental health conditions: A mixed-methods analysis of barriers to screening and care pathways in South East London

Tredget, G.; Milenova, M.; Parkash, R.; McGrath, R.; Edwards, M. J.; Gee, S.; Pigg, W.; Karwacki, D.; Costa, C.; Shafique, S.; Adams, M.; Waghorn, J.; I'Anson, D.; Ronaldson, A.; Haire, K.; Githuku, C.; Beveridge, E.; Williams, J.

2026-06-09 oncology 10.64898/2026.06.08.26355143 medRxiv

Top 1%

0.8%

Show abstract

Background: Adults with severe mental health conditions (often referred to as severe mental illness, SMI) experience 15 to 20 year mortality gap relative to the general population, with lung cancer a significant contributor. National cancer policy targets earlier diagnosis but does not explicitly address how pathways function for this group. Aims: This study aimed to describe lung cancer risk, prevalence, screening eligibility, referral activity and diagnostic pathway performance for adults with SMI in South East London (SEL), and to examine where along the pathway inequalities arise. Methods: Co-designed with experts with lived experience and voluntary sector, this exploratory mixed-methods service evaluation combined quantitative analysis of routinely collected data from the Quality Outcomes Framework (QOF), SMI Register and Cancer Waiting Times Record (April 2023-March 2024) with semi-structured qualitative interviews (n=11 clinical staff) and focus groups (n=6 adults with lived experience of SMI). Quantitative and qualitative data were analysed using descriptive statistics and framework-based thematic analysis respectively, and findings were integrated using a joint display approach, organised by the Consolidated Framework for Implementation Research (CFIR). Results: Lung cancer prevalence was approximately double among adults with SMI (0.17% vs 0.09% in the general population). Despite Urgent Suspected Cancer (USC) referral rates being more than twice as high in the SMI population (63 vs 28 per 100,000), fewer cancers were detected via planned general practice (GP) routes (11% vs 20%), the 28-day Faster Diagnosis Standard was not met for any SMI patient diagnosed with lung cancer during the study period; overall FDS performance was 76% in the SMI population compared with 84% in the general population; and appointment non-attendance was more than double that in the general population (6% vs 3%). Qualitative findings identified individual, service and system-level mechanisms, including stigma, diagnostic overshadowing, fragmented coordination, and rigid pathway protocols, that compound disadvantage across lung cancer pathway stages. Conclusions: Inequality in lung cancer outcomes for adults with SMI accumulates across the pathway rather than arising at a single point of failure. Addressing this requires proportionate adaptations within existing cancer pathways, alongside routine reporting of cancer outcomes stratified by SMI population. Keywords: severe mental health conditions, lung cancer, health inequalities, cancer screening, diagnostic pathway, mixed methods

11

Breast cancer polygenic risk score performance varies by socioeconomic status

Domian, H. I.; Tian, X.; Ong, D.; Hamilton, L.; Shieh, Y.; Musharoff, S. A.

2026-06-04 genetic and genomic medicine 10.64898/2026.06.03.26354819 medRxiv

Top 2%

0.6%

Show abstract

Background: Polygenic risk scores (PRS) for breast cancer are increasingly used for risk stratification to inform screening and prevention. However, for PRSs to be equitable and clinically useful, they need to perform well across diverse populations. While PRS performance is known to be ancestry-dependent, it is not well understood how environmental context, such as that of socioeconomic status (SES), affects PRS transferability. Here, we assess whether SES, measured via self-reported household income, modifies breast cancer PRS performance and, if so, whether socioeconomic context contributes predictive information beyond genetic risk alone. Methods: We used the US-based All of Us biobank to evaluate how SES impacts breast cancer PRS performance. First, we quantified changes in breast cancer PRS performance by modeling a commonly-cited polygenic score for breast cancer previously described by Mavaddat et al. with SES. We then reestimated the genetic effect sizes of the 3,820 variants from Mavaddat et al. in All of Us with and without income as a covariate. Because social determinants of health affect breast cancer detection and outcomes, we stratified analyses by socially defined populations on the basis of self-identified race and ethnicity. We further stratified individuals whose self-identified race is White (''White'') into three SES groups (high, middle, low) based on self-reported income and re-estimated genetic effect sizes to create SES-specific PRSs. We then applied these PRSs to White participants, the largest group in the study, and to Black or African American (''Black'') and Hispanic or Latino (''Hispanic'') participants, groups underrepresented in breast cancer research. Model discrimination between cases and controls was measured by area under the curve (AUC). Results: We analyzed 163,715 women from the All of Us biobank, which included 8,833 breast cancer cases (6,619 White, 1,178 Black, and 1,036 Hispanic), with relative income available for a subset of these cases (5,525 White, 848 Black, and 566 Hispanic). The ancestry-dependent performance of the breast cancer PRS described in Mavaddat et al. was replicated in All of Us. In Black individuals, this PRS (AUC and 95% CI: 0.576 [0.571, 0.582]) produced a similar increase in AUC as relative income (AUC: 0.573 [0.568, 0.577]) when added to an age-only model. Incorporating income with PRS, age, and genetic PCs 1-3 improved AUC by 0.007 in White Americans and 0.018 in Black Americans (both p < 10-11), while attenuating the contribution of PRS in the full model. PRS performance also varied among SES categories. Notably, PRSs with variant effect sizes that were recalibrated in low-SES White participants performed best in low-SES White participants (AUC: 0.605 [0.583, 0.628]) and Black Americans (AUC: 0.588 [0.586, 0.591]), both better than performance in high-SES White Americans (AUC: 0.579 [0.577, 0.580]) and middle-SES White Americans (AUC: 0.578 [0.569, 0.586]). Conclusion: Socioeconomic context, measured by income, significantly impacts the transferability of a PRS for breast cancer within and among groups defined by self-identified race and ethnicity. Accounting for SES improves PRS performance, most notably in Black Americans and low-SES White individuals.

12

Precision Imaging to Evaluate Kaposi Sarcoma (PRIME-KS): protocol for a multicountry novel artificial intelligence-based imaging device

Odeny, T. A.; Adhiambo, H. F.; Mangale, D.; Makanga, P. K.; Odeny, B.; Okuku, F.; Zhou, C.; Geng, E.; Carson, J.; Mudhune, V.; Bukusi, E.; Semeere, A.

2026-06-04 oncology 10.64898/2026.06.03.26354815 medRxiv

Top 2%

0.5%

Show abstract

Abstract Background: Kaposi sarcoma (KS) is the most common cancer among men in several Eastern African countries, yet treatment monitoring relies on imprecise, time-consuming ruler-based measurements defined by the AIDS Clinical Trial Group (ACTG). This method suffers from inter-observer variability, fails to capture lesion height or true geometric area, and performs poorly on dark skin. SkinScan3D (SS3D) is a portable, low-cost, AI-enabled 3D imaging device that provides objective measurements of KS skin lesion area, height, volume, and color. The Precision Imaging to Evaluate Kaposi Sarcoma (PRIME-KS) study evaluates whether SS3D provides more reproducible and accurate lesion measurements than the standard method, and validates its integration into routine clinical workflows in Kenya and Uganda. Methods: PRIME-KS is a multicountry prospective mixed-methods study with two clinical objectives. Objective 1 is a cross-sectional diagnostic accuracy study comparing SS3D with ruler-based measurement in 50 adults with KS (150 lesions) across sites in Kenya and Uganda. Two clinicians independently measure three lesions per participant using both methods. The primary outcomes are concordance correlation coefficient (CCC) for inter-rater reproducibility, and co-efficient of determination for accuracy. Objective 2 is a non-randomized before-and-after pilot study in 100 patients at three sites, evaluating device usability, acceptability, appropriateness, and feasibility using validated instruments, along with time-and-motion studies and activity-based micro-costing. Prior to these clinical objectives, a formative study used focus group discussions, discrete choice experiments, and human-centered design workshops to refine the SS3D device and protocols with end-user input. Discussion: PRIME-KS will provide the first rigorous evaluation of a 3D imaging device for monitoring KS treatment response in routine clinical settings. If SS3D demonstrates superior reproducibility and clinical utility, it could reduce unnecessary chemotherapy exposure and associated toxicities by enabling earlier, more objective assessment of treatment response. Trial registration: ClinicalTrials.gov NCT06898203, registered 27 March 2025. Pan African Clinical Trials Registry PACTR202603523439856. Keywords Kaposi sarcoma, SkinScan3D, 3D imaging, treatment monitoring, diagnostic accuracy, implementation science, usability, human-centered design, Kenya, Uganda

13

Contextualizing the Utility of Polygenic Risk Scores using Absolute Risk Models in Diverse Ancestry Populations

Chatterjee, N.; Martina, F.; Kachuri, L.; Natarajan, P.; Witte, J.; Huo, D.

2026-06-04 genetic and genomic medicine 10.64898/2026.06.03.26354842 medRxiv

Top 2%

0.5%

Show abstract

Polygenic risk scores (PRSs) are emerging as powerful tools for quantifying inherited risk for common diseases and, in some cases, are approaching clinical implementation. A major concern for PRS implementation is their limited accuracy in non-European populations, particularly in those of African ancestry. However, past evaluations have focused on metrics such as relative risk or AUC, which do not capture background risk arising from contextual factors. We introduce a novel measure of variable importance, the conditional average derivative estimator (CADE), to evaluate PRS utility across diverse contexts and populations within absolute risk models that integrate PRSs with other relevant risk factors. We illustrate this framework by integrating PRSs for breast and prostate cancer within age-specific absolute risk models for incidence and mortality fit using individual-level data from the All of Us Research Program with inputs from the National Cancer Institute SEER cancer registry. Our projections show that although the PRSs are known to have the lowest discriminatory accuracy in African Americans (AA), there are contexts in which they provide greater utility, such as for the stratification of prostate cancer risk and mortality, where the CADE values for AA were 2- and 7-fold higher than for European Americans. These findings suggest that conclusions about the limited clinical utility of PRS in non-European populations may be premature and underscore the need to quantify PRS risk-stratification utility at the absolute-risk level, while accounting for disease onset, survival, and broader health and economic factors.

14

Documented clinical genetic testing among carriers of hereditary breast and ovarian cancer variants: Ancestry and socioeconomic disparities in the All of Us research program

Yerukala Sathipati, S.; Scott, H.

2026-06-10 oncology 10.64898/2026.06.09.26355262 medRxiv

Top 2%

0.4%

Show abstract

Importance: Hereditary breast and ovarian cancer (HBOC) variant carriers benefit from risk-reducing interventions, but only if identified. The extent to which carriers are clinically recognized, and whether recognition is equitable across diverse populations, is poorly characterized in a single large U.S. cohort. Objective: To estimate P/LP HBOC carrier prevalence across genetic ancestry groups, quantify documented clinical genetic testing among carriers, and evaluate ancestry and socioeconomic disparities in testing. Design, Setting, and Participants: Cross-sectional analysis of the All of Us Research Program Controlled Tier (Curated Data Repository v8/C2024Q3R9), comprising participants with short-read whole genome sequencing and linked electronic health record (EHR) and survey data. Carriers were ascertained from research genomic data independent of clinical testing. Exposures: Genetically inferred ancestry (African [AFR], Admixed American [AMR], East Asian [EAS], European [EUR], Middle Eastern [MID], South Asian [SAS]); self-reported household income and educational attainment. Main Outcomes and Measures: (1) Carrier prevalence with Wilson 95% CIs; (2) documented clinical genetic testing (procedure codes) among carriers; (3) adjusted odds of documented testing among women, by ancestry, before and after socioeconomic adjustment, using multivariable logistic regression. Results: Among 414,830 participants, P/LP HBOC carrier prevalence was 1.42% (95% CI, 1.38-1.45) overall and similar across ancestry groups (AFR 1.24%, AMR 1.32%, EAS 1.19%, EUR 1.52%, MID 1.68%, SAS 1.33%; overlapping CIs). Among 250,071 women in the testing analysis, documented clinical genetic testing was rare: only 74 of 5,878 carriers overall (1.3%) and 59 of 3,572 European-ancestry carriers (1.7%) had a documented test, with counts below reportable thresholds in all other ancestry groups. African-ancestry women had lower adjusted odds of documented testing than European-ancestry women (Model 1 adjusted odds ratio [aOR], 0.32; 95% CI, 0.27-0.39), an association that attenuated but persisted after adjustment for income and education (Model 2 aOR, 0.48; 95% CI, 0.40-0.58; P < 0.001); Admixed American women also had reduced adjusted odds (aOR, 0.71; 95% CI, 0.61-0.84). Lower income and lower education were independently and dose-dependently associated with lower testing odds (income <$25,000 aOR, 0.46; high-school education aOR, 0.54). Conclusions and Relevance: High-risk HBOC variant carriers are present across all ancestry groups at similar frequencies, yet documented clinical genetic testing was disparate in the different ancestry groups. African-ancestry women experience a testing gap that is not fully explained by socioeconomic position, implicating structural barriers in access and referral. Population-level strategies that decouple carrier identification from current referral pathways may be required to close this gap.

15

Bias from small-count suppression in county-level cancer disparity estimates: a calibrated simulation study

gahan, k.

2026-06-08 epidemiology 10.64898/2026.06.05.26355021 medRxiv

Top 3%

0.3%

Show abstract

Abstract Background. Area-level cancer disparities are routinely estimated from public county data in which rates based on small counts (fewer than 16 cases or deaths) are suppressed. Analysts typically drop suppressed counties (complete-case analysis). Because suppression depends on case counts tied to population size and demographic composition, this missingness may be informative, but its effect on the disparity estimate has not, to our knowledge, been quantified. Methods. In a cross-sectional ecological study of 3,143 U.S. counties (analytic sample 3,018 with computable exposure) using one frozen public release of NCI State Cancer Profiles incidence and mortality data and ACS 2018-2022 5-year data, we estimated the most- versus least-deprived ICE(race+income) quintile rate ratio (RR) and rate difference for female breast, stomach, and cervix cancers under four suppression-handling methods: complete-case, available-case, bounding, and model-based small-area estimation. We characterized which counties were erased, and, following the ADEMP framework, ran a Monte Carlo simulation (1,000 replicates per cell; Monte Carlo standard error of bias approximately 0.0025) calibrated to the release to measure bias against a known truth. Analyses were pre-registered. Results. The suppressed fraction rose with rarity: 7.4% of counties for breast, 61.3% for stomach, and 75.7% for cervix incidence. Suppression was concentrated in the most-deprived quintile (cervix, 81.8% suppressed vs 63.8% least-deprived) and overwhelmingly removed rural rather than minority residents (cervix: 81% of the rural but 9% of the minority population erased). For breast (little suppression) the RR was 0.87 (95% CI 0.85-0.89) and identical across methods; for cervix incidence the complete-case RR (1.56) exceeded the model-based estimate (1.50), and for cervix mortality (91% suppressed) complete-case (1.86) exceeded model-based (1.56) by 16% with a wide bounding interval (1.88-2.62). In calibrated simulation, population-weighted complete-case bias was small (less than 2%) at the observed deprivation-county-size correlation and grew with rarity, threshold, and unweighted aggregation; its direction was conditional, becoming positive (over-estimation) as deprived counties became smaller. Conclusions. Complete-case handling of suppressed counties over-estimates rare-cancer area disparities relative to methods that retain them, while silently erasing most of the rural and most-deprived communities the estimate is meant to represent. The effect is negligible for common cancers and grows with rarity. Public-data disparity analyses should report the suppressed fraction and use bounded or model-based estimates by default. Keywords: cancer disparities; small-count suppression; Index of Concentration at the Extremes; informative missingness; small-area estimation; rural health.

16

Dementia and Frailty Impact Postoperative Care Trajectories and Burden among Older Adults Undergoing Radical Cystectomy for Bladder Cancer

Ernandez, J.; Xiang, L.; Adler, R.; Hsu, J.; Shah, S. K.; Kim, D.; Gershman, B.; Mossanen, M.; Weissman, J. S.

2026-06-06 urology 10.64898/2026.06.04.26354768 medRxiv

Top 3%

0.2%

Show abstract

OBJECTIVE: Bladder cancer (BC) is predominantly a disease of older, comorbid adults, and radical cystectomy (RC), which is the gold standard treatment, carries considerable morbidity. We sought to determine the impact of baseline dementia and frailty on the care trajectory beyond the immediate postoperative period. We hypothesized that frail patients and those with dementia undergoing RC for BC will have poorer care trajectories. METHODS AND MATERIALS: We identified Medicare beneficiaries [≥] 66 years old who underwent RC for BC in 2017 with 12 months of pre- and post-RC enrollment. Frailty and dementia were characterized using validated, claims-based measures. Associations between baseline frailty and dementia with postoperative care trajectory outcomes were determined using Fine-Gray competing risk models. RESULTS: We identified 3,600 beneficiaries of whom 11.6% were frail and 3.4% met criteria for dementia. Patients with dementia were more likely to be frail, comorbid, and not receive standard-of-care neoadjuvant chemotherapy. Frailty was independently associated with [≥] 2 transitions in care level after index discharge from RC and skilled nursing facility (SNF) admissions within 1 year of RC, exposure to intensive post-RC interventions, including dialysis and feeding tube placement, and poorer survival. Dementia remained associated with SNF admissions regardless of frailty level. CONCLUSIONS: Among a contemporary cohort of older adults undergoing RC for BC, preoperative dementia and frailty were independently associated with poorer care trajectory beyond the immediate postoperative period after RC. Our work highlights a role for preoperative geriatric assessment in identifying and optimizing patients at greatest risk.

17

Large Language Models in Healthcare Simulation Education: A Bibliometric Analysis with AI-Assisted Screening

Pears, M.; Wadhwa, K.; Payne, S. R.; Konstantinidis, S. T. H.; Biyani, C. S.

2026-06-04 urology 10.64898/2026.06.02.26354722 medRxiv

Top 3%

0.2%

Show abstract

Large language models (LLMs) such as ChatGPT are rapidly reshaping healthcare education and simulation-based training in non-technical skills (NTS), yet no bibliometric analysis has mapped this landscape. We searched seven open-access databases (OpenAlex, PubMed, Europe PMC, Crossref, Semantic Scholar, CORE, DOAJ) for English-language publications from January 2020 to March 2026. From 100,277 initial records, a sequential keyword funnel yielded 830 candidate papers, which were screened by 83 independent Claude Sonnet 4.6 AI agents applying pre-specified inclusion criteria (PRISMA-trAIce compliant; Cohen's kappa = 0.86 pre-reconciliation, 1.0 post-reconciliation). The final AI-verified corpus comprised 551 papers with a compound annual growth rate of 109%, contributions from 2,398 authors across 279 journals in 58 countries, and an h-index of 41. ChatGPT dominated the model landscape (46% of papers), with open-source models virtually absent. Virtual patient chatbots were the leading simulation modality (106 papers). Among NTS domains, communication (145 papers) and decision-making (135 papers) were most studied, whereas teamwork, leadership, situational awareness, and crisis resource management were markedly underrepresented. Only 6 urology-relevant papers were identified, none examining LLM integration within boot camp training formats. The field is growing at extraordinary pace but remains concentrated in a narrow range of NTS domains and a single proprietary model. Critical gaps persist in team-based skills training, open-source model evaluation, and specialty-specific simulation. AI-assisted bibliometric screening using multiple independent agents is feasible, reliable, and scalable, offering a replicable methodology for mapping fast-evolving research fields.

18

Quantifying Cancer Clinical Trial Eligibility Using Artificial Intelligence-Based Matching

Goel, K. P.; Myall, N. J.; Dickerson, J.; Caswell-Jin, J. L.; Johnson, T.; Worth, J. E.; Gensheimer, M. F.

2026-06-05 oncology 10.64898/2026.06.03.26354859 medRxiv

Top 4%

0.2%

Show abstract

PURPOSE: To develop and validate an artificial intelligence-enabled platform that converts unstructured cancer trial eligibility criteria into structured queries and quantifies trial eligibility across advanced/metastatic cancer trials. METHODS: We downloaded actively recruiting US interventional treatment trials for advanced/metastatic breast cancer, colon cancer, and non-small cell lung cancer from ClinicalTrials.gov. Medical oncologists created 24 synthetic patient vignettes. A large language model converted trial eligibility criteria into Structured Query Language (SQL) code and patient information into structured records, enabling automated matching. Cancer details and treatment history were considered, but not laboratory results or comorbidities. Validation included physician editing of generated eligibility code for 30 trials, and blinded physician eligibility assessment for five trials. We then evaluated how age, ECOG performance status, sex, and ZIP code affected the number of eligible trials. RESULTS: Of 833 candidate trials, 746 met inclusion criteria. In physician review of 30 trials, edits to generated SQL did not change any of 720 trial-patient eligibility determinations for 24 synthetic patients. In blinded validation across 120 trial-patient pairs, automated matching achieved 97% accuracy. Across synthetic patients, eligible trials ranged from 31 to 258 when there were no geographic restrictions. Eligibility decreased markedly with worse performance status and with geographic restriction (both p<0.001). Later-phase, randomized, and molecularly selective trials had fewer eligible patients. CONCLUSION: AI-based structuring of trial eligibility criteria can support accurate, scalable measurement of potential cancer trial eligibility. In this demonstration, performance status, geography, and age were major determinants of eligibility across the active metastatic trial landscape.

19

Burden of Chronic Kidney Disease in China, 1990-2021: Findings from the 2021 Global Burden of Disease Study

Wang, M.; Zhao, T.; Wang, H.; Hou, S.; Fu, Y.

2026-06-09 epidemiology 10.64898/2026.06.06.26355056 medRxiv

Top 4%

0.1%

Show abstract

Introduction: To investigate the epidemiological characteristics of chronic kidney diseases (CKD) in China in 2021 and its trends between 1990 and 2021, in the context of significant population growth and lifestyle changes over the past 30 years that have likely influenced the CKD spectrum. Methods: Data on CKD prevalence, mortality, disability-adjusted life-years (DALY), and risk factors were obtained from the Global Burden of Disease Study 2021. The estimated decadal percentage changes were calculated to evaluate changes in trends in prevalence, mortality and disease burden. Results: In 2021, an estimated 118.4 (95% UI 109.4 to 127.5) million people in China were affected by CKD, contributing to 204 230 (95% UI 164 736 to 246 372) deaths and 6.13 (95% UI 5.18 to 7.21) million DALY. Although CKD due to diabetes mellitus and hypertension accounted for less than a quarter of all cases, they were responsible for over 90% of CKD-related deaths. Over the past three decades, CKD mortality and DALY rates have steadily increased, although the prevalence has stabilized in the last decade. Diabetes mellitus type 2 and hypertension have emerged as key drivers of CKD burden in China. Conclusions: The CKD burden in China shows a dual pattern of rising incidence and high mortality from diabetes and hypertension-related chronic kidney disease, alongside persistently high years lived with disability from glomerulonephritis and other causes.

20

Serum Cotinine and Wrist-Worn Ambient Light Exposure Patterns in U.S. Adults: A Cross-Sectional Analysis of NHANES 2011-2014

Wong, A.; Lee, C. W.; Park, A.; Yin, L.; Choi, Y.

2026-06-04 epidemiology 10.64898/2026.06.02.26354759 medRxiv

Top 4%

0.1%

Show abstract

Background. Tobacco smoke exposure, quantified by serum cotinine, is associated with cardiovascular, metabolic, and sleep-related health risks. The relationship between biomarker-verified tobacco smoke exposure and objectively measured, free-living wrist-worn ambient light patterns has not been examined in a nationally representative U.S. adult sample. Methods. We analyzed NHANES 2011-2014 cross-sectional data from 6,937 adults aged >20 years with valid serum cotinine and wrist-worn Physical Activity Monitor (PAM) ambient light data. Seven light outcomes were modeled using survey-weighted linear regression with log2(cotinine+1) as the continuous exposure across four covariate adjustment levels. Benjamini-Hochberg false discovery rate (FDR) correction was applied across the 7 outcomes within each model. Results. In Model 2 (adjusted for age, sex, race/ethnicity, education, poverty-income ratio, BMI, and survey cycle; N = 6,350), higher serum cotinine was associated with significantly higher nighttime light (beta = +0.024, 95% CI: 0.010, 0.038; p-FDR = 0.014) and lower evening light (beta = -0.031, 95% CI: -0.055, -0.008; p-FDR = 0.042). In exploratory behavioral models without alcohol (Model 3a; N = 5,766), both nighttime and evening associations remained FDR-significant. After additional adjustment for alcohol, which substantially reduced the sample due to 37.6% missingness (Model 3b; N = 3,866), the nighttime association attenuated below the FDR threshold, while the evening association remained FDR-significant. Categorical analyses showed progressively higher nighttime light across cotinine groups, and a hypothesis-generating sex interaction was identified (p-interaction = 0.001). Conclusions. Higher serum cotinine concentrations were associated with higher nighttime and lower evening ambient light after sociodemographic adjustment. Attenuation after behavioral adjustment and the cross-sectional design preclude causal inference. Longitudinal studies with formal mediation analyses are needed to clarify the temporal ordering and mechanisms linking tobacco smoke exposure, smoking-related behaviors, and personal light-dark cycle patterns.